Graphix-T5: Mixing Pre-trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
نویسندگان
چکیده
The task of text-to-SQL parsing, which aims at converting natural language questions into executable SQL queries, has garnered increasing attention in recent years. One the major challenges parsing is domain generalization, i.e., how to generalize well unseen databases. Recently, pre-trained text-to-text transformer model, namely T5, though not specialized for achieved state-of-the-art performance on standard benchmarks targeting generalization. In this work, we explore ways further augment T5 model with components parsing. Such are expected introduce structural inductive bias parsers thus improving model’s capacity (potentially multi-hop) reasoning, critical generating structure-rich SQLs. To end, propose a new architecture GRAPHIX-T5, mixed augmented by specially-designed graph-aware layers. Extensive experiments and analysis demonstrate effectiveness GRAPHIX-T5 across four benchmarks: SPIDER, SYN, REALISTIC DK. surpasses all other T5-based significant margin, achieving performance. Notably, GRAPHIX-T5-large reaches superior original T5-large 5.7% exact match (EM) accuracy 6.6% execution (EX). This even outperforms T5-3B 1.2% EM 1.5% EX
منابع مشابه
Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks
Although content is fundamental to our music listening preferences, the leading performance in music recommendation is achieved by collaborative-filtering-based methods which exploit the similarity patterns in user’s listening history rather than the audio content of songs. Meanwhile, collaborative filtering has the well-known “cold-start” problem, i.e., it is unable to work with new songs that...
متن کاملSomething Old, Something New - Applying a Pre-trained Parsing Model to Clinical Swedish
Information access from clinical text is a research area which has gained a large amount of interest in recent years. Automatic syntactic analysis for the creation of deeper language models is potentially very useful for such methods. However, syntactic parsers that are tailored to accommodate for the distinctive properties of clinical language are rare and costly to build. We present an initia...
متن کاملGraph parsing with s-graph grammars
A key problem in semantic parsing with graph-based semantic representations is graph parsing, i.e. computing all possible analyses of a given graph according to a grammar. This problem arises in training synchronous string-to-graph grammars, and when generating strings from them. We present two algorithms for graph parsing (bottom-up and top-down) with s-graph grammars. On the related problem o...
متن کاملRelational Database with Sql and Graph Database
This paper represents the study of various theory based on database model. In this paper a study of various papers is done, and in the reviewed paper graph database and SQL is done. This huge repository of unstructured data has resulted in making the data search and knowledge extraction, a very cumbersome task if one continues using the legacy relational database.
متن کاملSnipSuggest: Context-Aware Autocompletion for SQL
In this paper, we present SnipSuggest, a system that provides onthe-go, context-aware assistance in the SQL composition process. SnipSuggest aims to help the increasing population of non-expert database users, who need to perform complex analysis on their large-scale datasets, but have difficulty writing SQL queries. As a user types a query, SnipSuggest recommends possible additions to various ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i11.26536